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AMENDMENT TO THE CLAIMS 

1. (CURRENTLY AMENDED) A computer-readable medium having computer-executable 
instructions for performing steps for building a symbol table for storing sort weights for a 
plurality of linguistic symbols used in a plurality of languages supported by a computer system, 
comprising: 

constructing the symbol table to contain a list of code points each uniquely identifying 
one of the symbols, and a sort weight for the symbol identified by said each code 
point; 

providing a plurality of compression tables, each compression table pertaining to one of 
the supported languages and having a compression type identifying a number of 
symbols in a given compression and containing compressions of symbols of that 
compression type, each compression being a grouping of two or more symbols 
treated as a single unit -sort element for purposes of linguistic sortin g such that an 
order of a given compression in the linguistic sorting is determined based on a 
compression type of the given compression, a first of the two or more symbols in 
the given compression and a predefined order of symbols ; 

for each code point in the symbol table, sorting the compression tables , based on the sort 
elements, and te-identifying a highest compression type euf -for compressions 
beginning with the symbol identified by said each code point; and 

storing in the symbol table a tag for each code point to indicate said highest compression 
type for said each code point. 

2. (ORIGINAL) A computer-readable medium as in claim 1, wherein the code points are 
assigned to the symbols according to the Unicode standard. 
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3. (ORIGINAL) A computer-readable medium as in claim 1, wherein the tag for each code point 

is stored as a portion of the sort weight of the symbol identified by said each code point. 

4. (ORIGINAL) A computer-readable medium as in claim 3, wherein the sort weight of the 
symbol identified by said each code point comprises a case weight value, and wherein the tag for 
said each code point is stored as part of the case weight value for said each code point. 

5. (ORIGINAL) A computer-readable medium as in claim 1, further comprising computer- 
executable instructions for performing steps of sorting compressions in each of the compression 
tables based on combinations of code points of the compressions in said each compression table. 

6. (CURRENTLY AMENDED) A method of building a symbol table for storing sort weights for 
a plurality of linguistic symbols used in a plurality of languages supported by a computer system, 
comprising: 

constructing the symbol table to contain a list of code points each uniquely identifying 
one of the symbols, and a sort weight for the symbol identified by said each code 
point; 

providing a plurality of compression tables, each compression table pertaining to one of 
the supported languages and having a compression type and containing 
compressions of symbols of that compression type, the compression type 
identifying a number of symbols in a compression, and each compression being a 
grouping of two or more symbols treated as a single ^HHt -sort element for purposes 
of lingusitic linguistic sorting; 

for each code point in the symbol table, sorting the compression tables to order the 
compression and to identify a highest compression type for compressions , the 

order of the compressions being peifomied by ordering compressions based on a 
first of the two or more symbols and then ordering the compressions based on 
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compression types, beginning with the symbol identified by said each code point; 

and 

storing a tag in the symbol table for each code point to indicate said highest compression 
type for said each code point. 

7. (ORIGINAL) A method as in claim 6, wherein the code points are assigned to the symbols 
according to the Unicode standard. 

8. (ORIGINAL) A method as in claim 6, wherein the tag for each code point is stored as a 
portion of the sort weight of the symbol identified by said each code point. 

9. (ORIGINAL) A method as in claim 8, wherein the sort weight of the symbol identified by said 
each code point comprises a case weight value, and wherein the tag for said each code point is 
stored as part of the case weight value for said each code point. 

10. (ORIGINAL) A method as in claim 6, further including the step of sorting compressions in 
each of the compression tables based on combinations of code points of the compressions in said 
each compression table. 

11. (CURRENTLY AMENDED) A computer-readable medium having computer-executable 
instructions for performing steps for a computer search program to carry out a linguistic sorting 
operation, comprising: 

receiving an input string containing a plurality of linguistic symbols letters used in a given 
language; 

for a first symbol letter in a combination of symbols letters in the input string, referencing 
a sjonbol table to obtain a highest compression type for compressions beginning 
with said first symbol letter . each compression being a grouping of two or more 
symbols letter treated as a single unit -sort element for purposes of linguistic 
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sorting and the compression type identifying a number of symbols letters in a 
given compression, the symbol table having a list of code points each uniquely 
identifying a symbol letter and a sort weight for the symbol letter identified by 
said each code point; 

performing a binary search through each of a plurality of compression tables containing 
compressions for the given language to find a matching compression that matches 
said combination of symbols letters in the input string, wherein the plurality of 
compression tables are searched in a descending order of compression types of the 
compression tables starting with a compression table having a compression type 
equal to said highest compression type for said first symbo U etter . 

12. (CURRENTLY AMENDED) A computer-readable medium as in claim 11, wherein the 
compressions in each of the compression tables are sorted according to code points for symbols 
letters forming the compressions. 

13. (CURRENTLY AMENDED) A computer-readable medium as in claim 12, wherein each 
code point in the symbol table includes a tag indicating a highest compression type for said each 
code point, and wherein said step of referencing retrieves the tag for the code point identifying 
said first symbo U etter . 

14. (ORIGINAL) A computer-readable medium as in claim 13, wherein the tag for each code 
point in the symbol table is stored as a portion of the sort weight for said each code point. 

15. (CURRENTLY AMENDED) A computer-readable medium as in claim 11, wherein the code 
points in the symbol table are assigned to symbols letters according to die-a_Unicode standard. 

16. (CURRENTLY AMENDED) A computer-readable medium as in claim 11, wherein the 
computer-executable instructions for performing a binary search form a_module that is called for 
searching each of the compression tables. 
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17. (PREVIOUSLY AMENDED) A computer-readable medium as in claim 11, having further 
computer-executable instructions for storing a sort weight for the matching compression. 



18. Cancelled. 

19. Cancelled. 

20. Cancelled. 

21. Cancelled. 

22. Cancelled. 

23. Cancelled. 

24. Cancelled. 



